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ABSTRACT 

The deepest extragalactic X-ray observation, the 2 Ms Chandra Deep Field North (CDF-N), resolves ^80% 
of the total extragalactic cosmic X-ray background (CXB) in the 1-2 keV band. Recent work has shown 
that 70% of the remaining CXB flux is associated with sources detected by the Hubble Space Telescope (HST). 
This paper uses the existing CDF-N data to constrain the X-ray flux distribution of these X-ray-undetected HST 
sources by comparing the number of 0.5-2 keV X-ray counts at the HST positions to those expected for model 
flux distributions. In the simple case where all the undetected HST X-ray sources have the same 0.5-2 keV flux, 
the data are best fit by 1.5-3 counts per source in 2 Ms, compared to a detection limit (at 10% completeness) 
of 9 counts. Assuming a more realistic power-law logA^-logS distribution [N(> S) oc S~ a ], the data favor a 
relatively steep flux distribution, with a = l.l^ (limits are 99% confidence). This slope is very similar to that 
previously found for faint normal and starburst galaxies in the CDF-N. These results suggest deeper Chandra 
observations will detect a new population of faint X-ray sources, but extremely deep exposures are needed to 
resolve the remainder of the soft CXB. In the most optimistic scenario, when the HST sources have the flattest 
allowed flux distribution and all the sources without HST counterparts are detected, observations 5 times more 
sensitive than the existing ones would resolve at most ^60% of the remaining soft CXB. 
Subject headings: methods: data analysis — X-rays: diffuse background — X-rays: galaxies 



1. INTRODUCTION 

Deep X-ray observations, in particular the 1 and 2 Ms 
Chandra Deep Fields North and South (CDF-N and CDF-S), 
have resolved most of the extragalactic cosmic X-ray back- 
ground (CX B) at energies E < 4 keV into discrete X-ray 
sources (see iBrandt & Hasingerl 120051 for a review). The 
spectrum of the still -unresolved CXB in the CD Fs was di- 
rectly measured by iHickox & Markevitchl d2006l hereafter 
HM06), who found that 23%±3% of the CXB in the 1-2 
keV band remained unresolved. There is also an unresolved 
component at energies E > 2 keV, but the uncertainties are 
larger. In the 1-2 keV band, most of the unresolved flux can 
be accounted for by galaxies that are detected in deep Hubble 
Space Telescope (HST) observations, but are too faint to be 
detected as individua l X-ray sources. A stacking analysis by 
IWorsley et all ((2006) showed a significant contribution from 
X-ray-undetected HST sources. Hickox & Marke vitchl (120071 
hereafter HM07) measured the remaining CXB spectrum af- 
ter excluding all sources detected by HST or by the Infrared 
Array Camera (IRAC) on the Spitzer Space Telescope, and 
found that only 7% ±3% of the total 1-2 keV CXB remained. 

There has been significant interest in using deeper Chan- 
dra observations to resolve even more of the CXB, especially 
since longer Chandra exposures might probe a new popula- 
tion of sources. A simple extrapolation of the total observed 
log/V-logS distribution [which, fitted by a power law of the 
form N(> S) oc S~ a , has a ~ 0.7] falls far short of account- 
ing for the entire 1-2 keV CXB. If the remaining signal is 
due to discrete so urces, a n upturn in the log N — log S at low 
fluxes is required (HM06). Evidence for a possible upturn in 
the log/V-logS distribution at So.5-2keV ~ 10~ 17 ergs s" 1 was 
found throug h a fluctuation analysis using the first 1 Ms of 
CDF-N data dMiyaji & Griffithsll2002h . 

These faint objects, with fluxes less than the faintest CDF-N 

1 Also at Space Research Institute, Russian Academy of Sciences, Prof- 
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sources (So.5-2keV = 2-4 x 10~ 17 ergs cm -2 s -1 ) are likely star- 
burst and normal galaxies, unlike the majority of the detected 
X-ray sources in t he CDFs, of which >75% are active galac- 
tic nuclei (AGNs). iBauer et all (120041 hereafter B04) divided 
the CDF X-ray sources into galaxy and AGN subsets, based 
on their X-ray and optical properties, and produced separate 
log/V-logS fits to each using power law models. AGNs have 
a shallow logAZ-logS slope with a ~ 0.6, while the galaxies 
have a much steeper distribution with a ~ 1.3. The galaxies 
may dominate the source counts at fluxes < 10~ 17 ergs s" 1 , 
and could produce the upturn in the log/V-logS distribution 
at low fluxes. 

The goal of this paper is to constrain the distribution of X- 
ray fluxes for the HST sources that are cumulatively responsi- 
ble for most of the unresolved 1-2 keV CXB. This study puts 
limits on the log/V-logS distribution only for objects with 
HST counterparts, not all unresolved X-ray sources (which 
would require a fluctuation analysis without the advantage 
of positional information). However, since X-ray-undetected 
HST sources account for ^70% of the unresolved 1-2 keV 
CXB (or ~16% of the total CXB), this analysis provides use- 
ful limits on the properties of X-ray sources fainter than the 
current CDF-N limit and puts constraints on what might be 
observed with longer Chandra exposures. 

2. OBSERVATIONS 

The CDF-N consists of 20 separate Chandr a ACIS-I ob- 
servations with a total exp osure time of ~2 Ms (IBrandt et al.1 
1200 It [Alexander et al.l2003l hereafter A03). For this analysis, 
we utilize the coadded image of the C DF-N in the 0.5-2 keV 
that was used for source detection by |A03| (see their Fig. 3). 
The ACIS pixel scale is 0.492", and the effective aimpoint 
(the exposure weighted centroid of the 20 slightly offset ob- 
servations) is (a, S, J2000) 12:36:45.9, +62:13:58. We also 
use the 0.5-2 keV exposure map shown in Fig. 5 of lA03l 2 The 
image has a total on-axis exposure time of 1 .94 Ms, although 
the effective exposures at certain positions within our region 
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of interest are as low as 1 .2 Ms because of the chip ga ps. The 
on-axis exposure is ^2 times longer than that used in HM07, 
in which we filtered the data extensively for background flares 
(see § ED. 

The nominal flux limit of the lA03l catalog is 2.4 x 10" 17 
ergs s" 1 , or w 9 counts in 1.94 Ms. This corresponds roughly 
to the flux of the faintest detected sources, for which the 
source detection is highly incomplete. The completeness of 
the CDF-N source detection depen ds on measured source 
counts, as shown in Fig. 2 (a) of B04. Within 2' from the aim- 
point, the completeness is near zero for < 8 observed counts, 
and rises from ~10% for 10 counts to ~90% for 20 counts. 
We use this completeness curve in our modeling of the unde- 
tected X-ray sources. 

The HST catalog of sources is the same as we used in lHM07l 
and comes fro m the Great Observa tories Origins Deep Sur- 
vey (GOODS; iDickinson et al.ll2003l) . Data were taken us- 
ing the Advanced Camera for S urveys (ACS) on HST in the 
^435, V606, '775' an d ^850 bands (Giavalisco et al. 2004]). We 
use the public catalog 3 for the z&so band (sampling wave- 
lengths ^8300-9500 A), which has an approximate magni- 
tude limit of zx w — 27 (AB) and nominal positional uncer- 
tainty of wQ.l". IhM07| also excluded Spitzer IRAC sources, 
but 90% of these also had HST counterparts, so we do not 
consider them here. 

We slightly shift the positions of the HST sources to best 
match the positions of the brightest 100 |A03| X-rav sources. 
This small offset of -0.11" in a and -0.28" in S serves to 
register the coordinate frames to ~0.1" within 2' of the X-ray 
aim point. The positional uncertainties of the HST sources are 
significantly smaller than the ACIS pixel scale and so will not 
affect our results. 

3. ANALYSIS 

IWorsley et all (120061) and IHM07I sho wed t hat HST sources 
that do not have X-ray detections in the lA03l catalog nonethe- 
less contribute significant flux to the CXB. Although we can- 
not measure X-ray fluxes for these sources individually, the 
distribution of observed counts at the source positions can 
provide a useful constraint on their underlying X-ray flux dis- 
tribution. 

3.1. Measurement of the observed counts distribution 

To measure this counts distribution, we extract 0.5-2 keV 
photons at the HST source positions in circular regions of ra- 
dius ^o, which is an approximation of the 90% point-spread 
function (PSF) energy enclosed radius and varies as 4 : 



r 90 = 1"+1O"(0/1O') \ 



(1) 



where 9 is the off-axis angle. Because rgo increases with 9, 
at large angles many of the HST source regions begin to over- 
lap. Within a 1' radius of the aimpoint, the regions overlap 
over only 9% of their area, while within a 4' radius, 28% of 
the area overlaps. To include sufficient numbers of sources 
but still have minimal overlap, we confine our analysis to a 
small region of radius 2.2' around the aimpoint, for which rgo 
is between 2 and 3 ACIS pixels (1" < rgo < l."5) and over- 
lap is only 12%. Using a model of the Chandra PSF as de- 
scribed in § 13.21 we find that for sources within the 2.2' radius, 

2 For CDF-N images and exposure maps from IA031 see 
http://www.astro.psu.edu/users/niel/hdf/hdf-chandra.html 
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FIG. 1. — Histogram of background-subtracted 0.5-2 keV source counts 
inside the 2184 HST source regions (black crosses). Solid lines show pre- 
dictions for models with "delta-function" flux distribution, consisting only of 
sources with fluxes of 0.5, 2, and 4 counts, respectively, per 2 Ms. 



the flux scattered outside the ryo circles is 9.9%, accounting 
for the source overlap. In order to exclude the flux from de- 
tected X-ray sources, we only consider those HST sources that 
are o utside large exclusion regions of 4.5rc)o-9r9o around the 

_____ 

sources, which exclude > 99.9% of the flux from these 
sources (see § 3.2 of|HM Q6|)- We also exclud e one extended 
source from the catalog of lBauer et alJ d2002l) . with an exclu- 
sion region of radius 0.75', or 1.5 times the measured source 
extent. This leaves Nhst = 2184 HST sources over an area of 
12.2 arcmin 2 . 

We focus our analysis o n the 0. 5-2 keV band, as opposed 
to the 1-2 keV band as in IHM07L for two reasons . Fir st, it 
allows us to use the published CDF-N image from lA03l ex- 
actly the same image used for source detection. Second, it 
enables straightforward comparisons between our results and 
previous measurements of the soft X-ray log TV- log S, which 
usually are measured for 0.5-2 keV. One drawback of the 
0.5-2 keV band is that it also includes significant diff use sig - 
nal at E < 1 keV from local and Galactic emission (HM06); 
however, this diffuse component can be accurately subtracted 
from the fluxes measured in the small HST source regions 
(see below), and so does not significantly affect the measured 
counts distribution. We assume that the fraction of the extra- 
galactic CXB resolved by HST sources is the same in the 0.5- 
2 and 1-2 keV bands, since faint X-ray sources tend to have 
power law spectra with photo n index T ~ 1-4, s imilar to the 
to tal extra galactic CXB (e.g.. iRosati et al.ll2002l see Fig. 13 
of lHM06h . 

A key element in this analysis is the background sub- 
traction, since the background not associated with the HST 
sources is ^4 times larger than the source flux within the 
small source regions. Our background-subtraction procedure 
is different from that used in |HM06| and IHM071 Here the 
background consists of (1) local and Galactic diffuse sky 
emission, (2) flux from faint unresolved X-ray sources that 
do not have HST counterparts and whose positions we as- 
sume are random in the field, and (3) the instrumental back- 
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FIG. 2. — (a) Simulated photon probability map and (b) corresponding 
sky image, for a model in which all HST sources with X-ray counterparts 
have average fluxes of 2 counts per 2 Ms exposure. The extraction regions of 
radius rgg around all X-ray sources are shown in (b). The large empty regions 
within the area of radius 2.2' are the excluded regions around detected X-ray 
point and extended sources. 



gr ound, w ith both quiescent and flaring components (see § 4 
of lHMOfjl) . The full 1.94 Ms image from lA03l includes some 
flux from background flares, which can be elimi nate d using 
detailed light-curve analysis, as we did in IHM06I and IHM071 
Here, in order to maximize the available exposure time we 
do not clean for flares. The flaring background is distributed 
nearly evenly across the X-ray image, so we can treat it as a 
component of the total diffuse background. 

All the above components of the background are dis- 
tributed relatively smoothly across the field of view, so we 
can accurately estimate the intensity and spatial distribution 
of the backgrou nd by performing a wavelet decomposition 
dVildilininet al.il 19981) on the 0.5-2 keV image itself. The 
small-scale wavelet components include all detectable X-ray 
sources, while the largest scale (40") component approxi- 
mates the background. This component also includes sig- 
nal from the X-ray-undetected HST objects, whose fluxes we 



want to analyze. To remove this signal, we measure the num- 
ber of 0.5-2 keV counts outside the HST and X-ray exclu- 
sion regions but inside the 2.2' circle, and then normalize the 
large-scale wavelet image to match the surface brightness in 
this area. We approximately account for the fact that 9.9% 
of the flux from the HST sources is scattered outside the rgo 
source regions. This normalization is given by 

I b -0.Q99(I H st-M hst I b /M b ) 

Nb = M~ B ' (2) 

where Ib and Mb are the number of counts in the regions out- 
side the HST and X-ray source regions and Ihst and Mhst are 
the total counts inside the HST regions, in the image (/) and 
uns ealed wavelet background model (M), respectively. For 
the[A03|0.5-2 keV image, I B = 14,691, M B = 15,584, I HST = 
4806, and M HST = 4175 counts, resulting in N B = 0.937 ± 0.08 
(la statistical error). 

Subtracting this background leaves Ihst - NbMhst = 894 
net counts associated with the HST sourc es. Using the av- 
erage counts-to-flux conversion from |A03| for sources within 
2.2' of the aimpoint, this corresponds to a 0.5-2 keV surface 
brightness of ~8 x 10~ 13 ergs cm" 2 s" 1 deg~ 2 over the 12.2 
arcmin 2 area. Assuming a T = 1 .4 power-law spectrum, this 
agrees well with the 1-2 keV brightnes s of 5.7 x 10~ 13 ergs 
cm " 2 s _1 deg- 2 fortheHSr sources from HM07, derived from 
a completely different procedure. 

To derive the distribution of source counts, we measure the 
counts in each of the 2184 HST source regions after subtrac- 
tion of the "diffuse" background model and produce the dis- 
tribution shown in Fig. Q] We include bins up to 15 source 
counts, which corresponds to the ~70% completeness limit 
for the source detection (B04). Although the histogram bins 
with >13 counts contain no sources, we include them because 
they help constrain some model flux distributions; the exact 
upper bound does not strongly affect our results. 

Because many HST regions overlap, the total number of 
counts in the histogram is 1048, larger than the total of 894 
counts mentioned above. We also determine the average ex- 
posure time for each HST region, which varies from ~1.2 Ms 
in the chip gaps to 1.94 Ms on axis, with a mean value of 
(fexp) — 1.8 Ms; this is used below. 

3.2. Comparison to logN— logS models 

To constrain the X-ray flux distribution of the undetected 
HST sources, we now compare the observed counts histogram 
to the predictions for various model log N— log S curves. Pois- 
son fluctuations cause the observed counts distribution to dif- 
fer from the underlying flux distribution. In addition, we can- 
not simply evaluate predictions of the logiV-logS models 
analytically, because many source regions overlap and share 
the same photons, so we must account for the spatial dis- 
tribution of the HST sources and the variation in the Chan- 
dra PSF Therefore, we produce simulated X-ray images for 
each model logA^-logS distribution and then derive count 
histograms that we can directly compare to the CDF-N his- 
togram. For each model log N — log S, we use the following 
procedure: 

1. We produce a model flux distribution in units of 0.5- 
2 keV counts in 2 Ms (2 Ms rather than 1.94 or 1.8 
Ms is used for simplicity). The fluxes S range from 
0.05 to 30 counts per 2 Ms (which extends sufficiently 
above the source detection limit) in ng ux = 600 bins 0.05 
counts wide. Our final results are not highly sensitive 
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FIG. 3. — Probability map similar to Fig. [2a), but for a power-law logiV — 
log S model with a = 1 . 



to the flux bounds or bin size. We define N src (S) to be 
the number of sources in each bin of flux S in the 12.2 
arcmin 2 area. 

2. We normalize the flux distribution to produce, on aver- 
age, the observed 894 total source photons in the HST 
regions. The lcr statistical uncertainty in this value is 
±80 counts, mostly owing to fluctuations in the to- 
tal counts before subtraction of background, but also 
includes uncertainty in the background normalization 
(Eqn. 0, which corresponds to ±31 counts. The er- 
ror on the total number of source counts corresponds 
directly to uncertainty in the normalization of the unre- 
solv ed lo g N— log S distribution. However, as we show 
in § 15.41 statistical fluctuations do not affect our con- 
straints on the shape of the log N— log S distribution, so 
for technical simplicity we ignore this uncertainty when 
constraining the logA^-logS slope. 

In normalizing the model logA^-logS distribution, we 
account for the scattering of flux outside the rgo regions, 
as well as the probability that a source of a given flux 
would be detected in the CDF-N and not associated 
with the "unresolved" HST sources. For a given flux 
S, the average number of observed source counts inside 
the source regions is 

S9o(S) = 0.9 fffU (3) 
2 Ms 

The average total number of counts, including back- 
ground, in each rgo circle is 

S t JS) = S 90 (S)+(S B ), (4) 

where (Sb) = 2.03 is the mean number of background 
counts per source region. For the purposes of normal- 
izing the model logA^-logS only, we assume that the 
exposure time and number of background counts in all 
the HST source regions are equal to (/ eX p) and (Sb), al- 
though they vary by up to 30%. As we show below, this 
gives a sufficiently accurate normalization to match the 
number of observed counts. 



In the observed image, we have excluded detected X- 
ray sources using much larger regions of radii > 4.5r9o 
to avoid PSF-scattered flux. To make the models con- 
sistent with this exclusion, we must remove sources 
fro m the simulated image that would be "detected" in 
the lA03l catalog. For a given source flux, there is a prob- 
ability (given by Poisson scatter and source detection 
completeness) that a source will be detected and thus 
removed from the image; we estimate this probability 
and include it in the normalization as follows. 

For a given S and (Sb), the number of observed counts 
is determined by a Poisson distribution with mean S (ot . 
For j total counts, the measured source counts will be 
Sj = j- (Sb)- Here we define f c (Sj) to be the probability 
of detecting a source with Sj source counts within an 
r<)Q circle. We estimate f c from the lB04l completeness 
curve (which is for aperture-corrected total counts), by 
multiplying their x-axis by 0.9. 

On average, the number of counts contributed by a 
source with flux S to the total source counts in the rgo 
regions around undetected sources is Sgo/nondet) where 

oo 

Y, s j p pU,s tot )[i-f c (Sj)] 

/nondetOS , (Sb)) = ~ 33 j (5) 

7=0 

and where Pp(J,Stot) is the Poisson probability of ob- 
serving j counts for a mean of Stot- 

On average, the total number of background-subtracted 
source counts in all the rgo regions associated with X- 
ray-undetected HST sources is therefore 

nam 

Csrc = y^Ar src (S;)S9o(S,0/nondet(S/, (Sb))- (6) 
i=l 

We normalize the model flux distribution N STC (S) so that 
C src = 894. 

We stress that this normalization is not a free parame- 
ter that we fit to the observed histogram. Instead, we 
normalize the model logA^-logS before producing the 
simulated images, so that on average, they match the 
total number of source photons observed in the sky im- 
age. This normalization approach proves sufficient; the 
resulting simulated images (that properly include vari- 
ations in the exposure, background, and PSF, as de- 
scribed in the following section) produce, on average, 
886 counts from "undetected" HST sources, within 1 % 
of the target value of C src = 894. We stress again that 
uncertainty in this total of 894 counts does not affect 
our constraints on the log N — log S shape, as we show 
in§El 

3. If the normalized flux distribution contains more than 
Nhst = 2184 X-ray sources between 0.05 and 30 counts 
(that is, more than the number of X-ray-undetected HST 
sources), we truncate the distribution at the faint end 
and renormalize it so that 

^/V src (S,)=AW, (7) 
i=i 
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FIG. 4. — Same as Fig.ff] with models for power-law log N — log 5 distribu- 
tions. 



while keeping C src = 894 as per Eqn.|6] This effectively 
sets a minimum flux for the simulated X-ray sources. 
For the power-law models described below, this is re- 
quired for slopes a > 1. The cutoff is always at < 0.3 
counts per 2 Ms and does not significantly affect the fi- 
nal model fits (we might have used a more sophisticated 
logA^-logS distribution shape with a natural low-flux 
cutoff, but it is not warranted by the data). If the total 
number of sources turns out to be less than Nhst, we do 
not adjust the model flux distribution, since not all HST 
sources are necessarily X-ray sources. 

4. We randomly assign fluxes to the 2184 HST positions 
from the distribution given by the normalized N SIC {S). 
For each source, we use a model of the Chandra PSF 
(from the Chandra CALDB) to create a 64 x 64 pixel 
X-ray probability map centered on the HST position, 
normalized to the model X-ray flux for that source and 
t exp at the source position. The two-dimensional PSF 
model averages over the 20 constituent CDF-N obser- 
vations that have different pointings (and thus off-axis 
angles) and exposure times. 

5. We use the above model to create a simulated image of 
the sky. We produce Poisson realizations of the normal- 
ized background model (described in § 13. U and of the 
probability maps for the sources, and add these realiza- 
tions to create a simulated photon image. 

In the process of creating this image, we eliminate "de- 
tectable" sources. Before adding each source to the im- 
age, we derive the total observed (post-Poisson scatter) 
source plus background counts / in the r^ circle. This 
corresponds to I — Sb "measured" counts, where Sb is 
the value of the background model in the source region. 
Based on the probability fdl-Ss) that this number of 
observed counts is detectable, we randomly assign the 
source as "detected" or "undetected", and do not add 
detected sources to the simulated image. 

We repeat steps 4 and 5 1000 times in order to average 
the random fluctuations in each simulated image. For each 
of these 1000 simulated sky images, we derive the number 
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FIG. 5. — Variation in the C-statistic above the minimum (AC) vs. power- 
law slope a of the source flux distribution. The solid line shows the results 
using all histogram bins, while the dashed line shows results for only those 
bins with < 9 counts. The upper dotted line corresponds to the 99% confi- 
dence limit (AC = 6.6). 



of counts within r^o for each of the HST source regions, and 
produce a histogram of the background-subtracted counts, ex- 
actly as for the sky image. We average these 1000 histograms 
to produce a predicted counts distribution for this log log S 
model and compare it to the observed one. 

In comparing to the observed counts histogram, the \ 2 test 
is not appropriate because some of the bins have less than 
five sources, so the uncertainties are not Gaussian. Therefore, 
we calculate the best-fit mode l and para meter confidence in- 
tervals using the C-statistic dCashl Il979h . which is valid for 
fits to bins with low numbers of counts and Poisson errors. 
The probability distribution of AC, which is the difference 
between C for the true value of the free parameter and its 
measured minimum in C, is similar to the \ 2 distribution for 

1 degree of freedom, which we ve rify f or our particular case 
using a simulation (described in § 15 41 >. Thus, a 99% confi- 
dence interval for one parameter corresponds to AC ~ 6.6. 
To calculate a rough goodness of fit (which is not determined 
by the C-statistic), we also estimate x 2 , using errors (as shown 
in Fig.Q]) given by a„ = (n + 0.75) 1//2 + 1, which is an a pproxi- 
matio n of Poisson uncertainties for n sources per bin dGehrelsl 
fl986h . 

4. RESULTS 

4.1. Delta-function flux distribution 

As a simple but illuminating exercise, we begin with a 
"delta function" flux distribution, such that all the X-ray- 
undetected HST sources have either a single X-ray flux or zero 
flux [that is, Nsrc(Si) = for all but one 5,]. We produce model 
distributions for 0.5, 1, 1.5, 2, 3, and 4 photons per source per 

2 Ms. A sample photon probability map and corresponding 
simulated image are shown in Fig. 12 and some of the result- 
ing histograms are shown in Fig.[T] Because of Poisson fluc- 
tuations, even this simple flux distribution produces a quite 
broad distribution in the observed source counts. 

Fig.fJJshows that if all sources below the detection threshold 
were relatively bright (with S > 3 photons in 2 Ms), this would 
produce too many HST sources with 4-8 observed counts 
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FIG. 6. — Same as Fig. [5] for different estimates of the diffuse ba ckgro und. 
The thick line shows AC vs. ot for the background as calculated in j} |3.1| (as in 
Fig. [3). The blue dashed lines show results for the same spatial distribution 
of the background, scaled by 1.022 and 0.978. The red dotted line shows 
r esult s for a spatially uniform background, with a mean value as calculated in 
§ 13.11 Varying the estimates of the background does not significantly change 
the constraints on a. 

within their rgo source regions and too few with 1-2 counts. 
Conversely, a distribution with only very faint sources (S < 1 
photon in 2 Ms) gives too few sources with 7-8 counts in the 
source regions. The data are fit well (\ 2 = 10 for 18 degrees 
of freedom) if all the undetected sources have fluxes of 1 .5-3 
photons per 2 Ms. 

4.2. Power-law flux distribution 

The actual flux distribution for the HST sources is likely 
not a delta function. Here we consider the simple case of 
a power-law distribution N(> S) oc S~ a , with a between 0.2 
(flatter than the faint-end slope of the total CDF logA^-logS) 
and 2.5 (steeper than the observed logA^-logS for galaxies; 
see §[T|). A sample photon probability map for a model with 
a = 1 is shown in Fig. [3] 

The resulting counts histograms for these models are shown 
in Fig. [4] The C-statistic is minimized at a = 1.1, for which 
the data are fit well (x 2 = 8 for 18 de grees of freedom). No- 
tably, the negative bins in Figs. Q] and [4] are well-described by 
all models, indicating that the statistical properties of the im- 
age are as expected and that we have properly accounted for 
background. The dependence of AC on a is shown by the 
solid line in Fig. [5] At 99% confidence (AC = 6.6) we can 
rule out flux distributions with a < 0.8 and a > 1.6. 

5. VERIFICATION 
5.1. Background uncertainties 

As described in § 13.11 the number of background photons 
in the HST source regions is ~4 times larger than the number 
of source photons, so our subtraction of the background is 
a key component of the analysis. Because we measure the 
spatial distribution and intensity of the background directly 
from the image itself, we expect our background model to 
be sufficiently accurate. Still, it is useful to check that the 
results do not depend strongly on our exact prescription for 
the background. 

First, we note that the data histogram shown in Figs.Q]and 
|4j to which we perform the model fits, consists of counts after 



subtraction of the background model. We use background- 
subtracted counts because they best represent the quantity 
of interest, which is the distribution of photons associated 
with the sources. Since the same background model is sub- 
tracted from both the data and the simulated images, we ex- 
pect this procedure to have minimal effect on the parameter 
constraints. To verify this, we also calculate AC using the 
histogram of total observed counts with no background sub- 
traction, and find that the confidence intervals are essentially 
identical (within w 5%) for the two fits. 

Next, we consider uncertainties in the (1) normalization and 
(2) spatial distribution of the background model. To address 
the impact of a different background normalization, we re peat 
the analysis for the logA^-logS models described in § 13.21 
but vary the overall background level by ±2.2% (or 99% con- 
fidence intervals for the statistical error in the normalization 
given by Eqn. [2]). This in turn changes the total number of 
photons associated with the HST sources (which is 894 for 
the nominal background) by ±10%. 

To address the effects of spatial variation, instead of us- 
ing a wavelet map of the background that approximates the 
chip gaps and large-scale intensity variations, we repeat the 
analysis using a flat (spatially uniform) background, normal- 
ized using Eqn. 12 which gives a background of 0.101 counts 
pixel" 1 . Each new background model produces a slightly dif- 
ferent histogram of background-subtracted counts, in addition 
to new model distributions. 

The values of AC versus a for these different background 
models are shown in Fig. [6] The variation in the background 
normalization changes the best-fit a by < 0.1. Because the 
background normalization uncertainty is statistical in nature, 
we include this error in quadrature; it has only a small effect 
on our final constraints on a. These alternative backgrounds 
give similar goodness of fit (6 < \ 2 < 1 1) to the nominal back- 
ground. 

5.2. PSF uncertainties 

We have also considered uncertainties in our model of the 
Chandra PSF, since scattered flux between source regions 
may affect the count histograms. To test this dependence, we 
repeat the calculations, changing the width of the model PSF 
by ±20%, or half the variation in rgo across our 2.2' region. 
We expect that t he PSF model is s ignificantly more accurate 
than ±20% (e.g.. I Allen etaT]l2003h . so this range is conserva- 
tive. Although we change the PSF, we still use the rgo circles 
as defined in Eqn. Q] to derive the histogram of counts in the 
HST regions. Within this range of PSF widths, the flux scat- 
tered outside the rgo regions is between ~7% and 13%, which 
in turn affects our estimate of the background normalization 
Nb (as in Eqn.|2|i by a small amount (~0.0015), as well as our 
normalization of the model logA^-logS (as in Eqns.[3H6]l. 

Properly accounting for these variations, we find that 
changing the PSF width by -20% has a negligible effect, 
while changing the PSF by +20% serves to increase the best- 
fit a by ~ 0.2. The wider PSF causes additional flux overlap 
between the individual sources, which produces more "unde- 
tected" sources with 8-15 counts and more strongly rules out 
models with low a. However, even for this conservative range 
of PSF widths, the uncertainty is still significantly smaller 
than our statistical upper bound on a. We conclude that PSF 
uncertainties have no significant effect on our constraints on 
the flux distribution. 

There is a further slight uncertainty due to the fact that when 
we add the PSF models to the simulated images, the source 
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positions are rounded to the nearest pixel, creating a posi- 
tional offset up to 0.35" between the model PSF in the image 
and the center of the source region. To verify that this does 
not affect our results, we repeated the analysis with the HST 
source regions shifted to account for this rounding offset; this 
produced negligible change in the constraints on a. 

5.3. Source detection completeness 

We further consider uncertainty in the model of the sourc e 
detection completeness of the |A03| catalog (given in B04), 
which may affect our modeling of the undetected HST sources 
with > 8 source counts. For example, some of our input 
logiV-logS models are flatter, and have more overlap be- 
tween bright sources than the model used by B04 to calculate 
the completness limits. It is not immediately clear how this 
source overlap would impact the source detection complete- 
ness. 

One simple test is to examine the variation in AC if we only 
include the bins at < 8 counts, which are not strongly affected 
by the completeness estimates. This AC curve is shown by the 
dashed line in Fig. [5] Although these fits give slightly worse 
constraints on a, they are consistent with the results from the 
full histogram. To more rigorously examine the impact of 
the completeness, we repeat the full analysis using the simple 
case of a hard flux limit; all sources with > 8 source counts in 
the rgo circle are detected, and all others are undetected (cor- 
responding roughly to the 9 count aperture-corrected limit for 
lA03l) . To perform this comparison, we use only the histogram 
bins with < 8 counts in the fits. This model gives nearly iden- 
tical confidence limits (within 0.02 in a) to the more sophisti- 
cated completeness model described above. We conclude that 
uncertainty in the detection completeness given in B04 should 
not significantly affect the results. 

5.4. Verification of the fitting procedure 

Finally, we confirm the statistical validity of our constraints 
on a. In particular, we verify that (1) AC is distributed ap- 
proximately like x 2 for 1 degree of freedom in our particular 
case, and (2) our constraints are not affected by our normal- 
ization of the model log N— log S distributions, which for com- 
putational simplicity we set to match the observed 894 source 
counts at the HST source positions (thus ignoring the ±9% la 
statistical uncertainty in this value). 

To this end, we perform a Monte Carlo simulation of the 
data and our fitting procedure, with two simplifications to al- 
low for fast computations: we do not consider source overlap 
(so that counts observed at each source position are indepen- 
dent, and we do not need to create simulated images), and we 
do not account for the detection of bright sources, only con- 
sidering sources with < 10 observed source counts. We define 
a model power-law logA^-logS distribution with our best-fit 
value of a = 1.1 and an average total flux of 1000 source pho- 
tons observed at 2500 source positions. We also include a 
constant background flux of 2 photons per source. 

We then create 1000 "observed" realizations of this flux dis- 
tribution, in which fluxes are assigned randomly to the 2500 
source positions. For each source, the observed counts are 
drawn from a Poisson distribution with a mean equal to the 
total source plus backgrou nd fl ux. We then follow the fit- 
ting procedure outlined in § 13.21 and fit each realization of the 
data with model power-law logA^-logS distributions having 
slopes in the range 0.7 < a < 1.6. For each realization, we 
normalize the model logA^-logS to produce, on average, the 
number of total observed counts in that realization, rather than 
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FIG. 7. — (a) The differential flux distribution (aW/dS) for unresolved 
HST sources, for the model power-law X-ray flux distributions allowed by 
the fits. The shaded area shows the 99% confidence intervals in power- 
law slope a and the normalization (see text for details). For comparison, 
the lines at right show the best-fit log N — log S for all CDF-N sources (thick 
gray line) and AGNs plus galaxies (dashed line) from lB04l For direct com- 
parison with our model, the AGN plus galaxy curve is multiplied by 0.76 
(see text), (b) The integral CD F \ogN — logS distribution, showing best-fit 
curv es from|Moretti et al. 1 2003, solid line) and the AGN plus galaxy fit from 
IB04I (dashed line). The gray area shows the 90% confid ence intervals from 
the flu ctuation analysis on the 1 Ms CDF-N exposure by Miyaji & Griffiths 
12002). The hatched region shows the limits from the power-law fits to the 
unresolved HST sources, with N(> S) set to equal the B04 galaxy plus AGN 
fit at So. 5-2 keV = 2.4 X 10~ 17 ergs cm -2 s . Note that our constraints pro- 
vide only lower limits to the total X-ray flux distribution, since some X-ray 
sources might not be detected by HST. 



the input average flux of 1000 source counts. In this way, we 
take into account statistical fluctuations in the total counts that 
we observe at the HST source positions. 

For each realization, we calculate C versus a, and deter- 
mine the best-fit a from the minimum in C. We find that in 
99% of cases, the best-fit value of a lies AC < 6.6 from the 
actual value of a = 1.1. This confirms that (1) AC is dis- 
tributed very similarly to x 2 f° r 1 degree of freedom, and (2) 
that statistical fluctuations in the total observed counts do not 
affect our constraints on the slope a. These fluctuations do, 
however, introduce uncertainty in the logA^-logS normaliza- 
tion, which we include in our final results. 
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6. DISCUSSION 
6.1. Constraints on the \ogN — \ogS distribution 

Our analysis shows that the distribution of photons in the 
CDF-N image associated with X-ray-undetected HST sources 
places significant constraints on their underlying flux distri- 
bution. Our final constraints on the slope are a = l.l^'f. The 
corresponding normalization of the differential flux distribu- 
tion is dN/dS = (1 .3 ± 0.3) x 10 22 sources deg" 2 (ergs cm" 2 
s" 1 )" 1 at 5o.5_2keV = 4 x 10~ 18 ergs cm -2 s" 1 (all uncertainties 
are 99% confidence). In Fig.[7](a), we show the dN/dS distri- 
butions allowed by the fits. The relatively small uncertainty at 
•Sb.5-2kev ~4x 10~ 18 ergs cm" 2 s" 1 is a result of the fact that 
we restrict our logA^-logS models to be power-law in shape. 

Our best-fit power-law model (a = 1.1) connects well with 
that of the obser ved C DF-N log./V-logS. In Fig. [7] (a) we 
compare it to the Sol fits for (1) the full CDF logN-logS, 
which has a ~ 0.7, and (2) the sum of the galaxies plus AGNs 
power laws fitted separately, which near the CDF-N flux limit 
can be approximated by a single power law with a ~ 1. While 
these log N— log S curves represent all X-ray sources, our fits 
constrain the flux distributions for HST sources only. There- 
fore, to account for the fact that only 16 out of the 21 detected 
X-ray so urces in our 2.2' circle have HST counterparts, we 
show the B04 galaxy plus AGN curve multiplied by this frac- 
tion (0.76). At the CDF-N limit, the normalizations of both 
lB04l curves are consistent with our allowed model distribu- 
tions, but the slope of the AGN plus galaxy logA^-logS more 
closely matches our best-fit model for the fainter HST sources. 
This indicates that the X-ray-unresolved HST sources might 
represent the faint end of the resolved sub-population of nor- 
mal and starburst galaxies, and that deeper Chandra observa- 
tions might resolve significant numbers of these sources. 

Fig. |7](b) shows fits to the integrated logjV -logS in the 
CDFs. Shown are with the best-fit curve from More tti et alj 
d2003l) . as well as the AGN plus galaxy fit from lB04l (which 
only included sources fainter than So.5-2keV = 2 x 10~ 16 ergs 
cm -2 s" 1 ). We also show the constraints from our fits, by 
setting N(> S) equal the |B04| galaxy plus AGN curve at 
So.5-2keV = 2.4 x 10~ 17 ergs cm" 2 s" 1 (this is necessary be- 
cause we analyze only the undetected sources). The unre- 
solved logA^-logS distributions allowed by our fits are con- 
sistent with the constraints obt ained by the fluctuation analy- 
sis on the first 1 Ms of data bv lMivaii & Griffithsl (120021) . and 
extend to lower fluxes. 

6.2. Prospects for deeper Chandra exposures 

How much of the remaining extragalactic CXB would be 
resolved by deeper Chandra exposures depends largely on 
the shape of the flux distribution for unresolved sources. For 
a shallow logA^-logS curve (i.e., low a), most of the unre- 
solved CXB would come from relatively bright sources that 
could be detected with moderately deeper observations. For a 
steeper logA^-logS, however, the bulk of the flux comes from 
much fainter sourc es. 

As measured by HM07L 70% of the unresolved (or 16% 
of the total) 1-2 keV CXB is associated with HST sources. 
Fig. [8] shows the fraction of this flux that would be resolved 
as a function of survey depth, for the power-law logA^-logS 
models used in § [3] Here we consider the CXB to be "re- 
solved" down to the flux of the faintest detected source s, ig- 
noring incompleteness in the source detection (e.g., the lA03l 
catalog is only ^10% complete at its nominal 9 count detec- 
tion limit). The models with high and low a (Fig. [8] dotted 
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FIG. 8. — The fraction of the unresolved extragalactic 0.5-2 keV CXB 
from undetected HST sources that can be resolved as a function of limiting 
Chandra flux, for power-law model log N — log S distributions. The current 
limit for 2 Ms is at the end of the ,t-axis. Models with a < 0.8 and a > 1.6 
are ruled out by the model fits (see Figs. [5]and[6}. The left y axis represents 
the flux as sociated with X-ray-undetected HST sources only (16% of the total 
soft CXB, HM07). The right y axis gives the corresponding fraction of the 
total CXB. 



lines) are ruled out by our present analysis. 

Let us consider further observations that are 5 times more 
sensitive than the current CDF-N, with a 0.5-2 keV flux limit 
of 5 x 10~ 18 ergs s _1 . Source detection in such observations 
would likely become background-limited, so this sensitivity 
would require exposures more than 5 times the existing one. 
For these exposures, the range of a allowed by the fits (0.8 < 
a < 1.6) corresponds to resolving between ^20% and 40% 
of 0.5-2 keV flux from the undetected HST sources. This 
would make up 13%— 30% of the unresolved (or 3%-7% of 
the total) soft CXB. For the best-fit power law model with 
a = 1.1, ^30% of the flux from the undetected HST sources 
would be resolved. 

Of course, the present analysis does not constrain the fluxes 
from those undetected X-ray sources that do not have HST 
counterparts. Such sources can contribute up to an additional 
~30% of the unresolved (or ~6% of the total) soft CXB. In 
the most optimistic scenario, all these sources would be de- 
tected in deep exposures, and the HST sources would have 
the flattest allowed logA^-logS slope. In this optimistic case, 
Chandra exposures 5 times more sensitive than the existing 
observations (i.e., > 10 Ms) would resolve up to ^60% of the 
remaining extragalactic soft CXB. 
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